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DESCRIPTION 



Endo- ft -N-Acetylglucosaminidase Gene v 



Technical Field 

The present invention relates to a novel endo- ft -N-acetylglucosaminidase gene, and 
specifically relates to a gene where said gene is derived from the genus Mucor. Further, the 
present invention relates to a recombinant plasmid which comprises said gene, an organism 
transformed with said plasmid, and a method of producing a novel endo- ft -N- 
acetylglucosaminidase using said transformant. 

Background Art 

Glycoproteins are widely found in animal tissue, plant tissue, and the cell membrane and cell 
wall etc. of eukaryotic microorganisms. 

In recent years, it has become increasingly clear that the sugar chains of glycoproteins play an 
important role in mechanisms such as cell differentiation, carcinogenesis and intercellular 
recognition. In order to clarify these mechanisms, research into the correlation between sugar 
chain structure and its function is proceeding. As a means of achieving this objective, when 
cleaving a sugar chain from a glycoprotein or when identifying the structure of a sugar chain, 
various glycosidases are employed. Among these, endo- ft -N-acetylglucosaminidase acts on 
the asparagine-linked sugar chain (N-linked sugar chain, N-sugar chain) and has the action of 
cleaving the diacetyl-chitobiose portion that exists within the sugar chain thereby liberating 
the sugar chain. 

Since endo- ft -N-acetylglucosaminidase can liberate the sugar portion of a glycoprotein from 
the protein portion, it is thought to be important in the analysis of the function and structure of 
sugar chains in glycoproteins. 



Asparagine-linked sugar chains may be classified by their structure as high mannose type 
(mannane type sugar chain), hybrid type, or complex type. 

Known endo- 0 -N-acetylglucosaminidases include Endo H (A. L. Tarentino and F. Maley, J. 
Biol. Chem., 249, 811 (1974)), Endo F (K. Takegawa, et al., Eur. j. Biochem., 202, 175 
(1991)), and EndoA (K. Takegawa, et al., Appl. Environ. Microbiol., 55, 3107 (1989)). 
However, these enzymes only act upon sugars with specific structures, and do not act upon 
glycoproteins except in the presence of a denaturing agent. 

Endo-iS -N-acetylglucosaminidase derived from Mucor hiemalis is capable of cleaving tri- 
antennary complex type sugar chains in respect of not only high mannose type (mannane type 
sugar chain) and hybrid type, but also complex type chains. Further, with the asialylated type, 
cleavage ability extends to tetra-antennary heteroglycan chains. Further, it is known that it is 
possible to free sugar chains from glycoproteins without subjecting the protein to 
denaturization treatment. (S. Kadowaki, et al., Agric. BioL Chem., 54, 97 (1990)). Therefore, 
endo- 0 -N-acetylglucosaminidase derived from Mucor hiemalis can be said to be useful in 
the research of the functional and physiological role of sugar chains and proteins of 
glycoproteins. 

On the other hand, conversion of mannane type sugar chains derived from yeast into sugar 
chains in a form compatible with humans, is extremely significant industrially. As methods 
for this conversion, in vivo conversion through improvement of the yeast sugar chain 
biosynthetic system by genetic manipulation, and in vitro conversion using the trans- 
glycosylation reaction can be considered. For the purpose of sugar conversion, endo- 0 -N- 
acetylglucosaminidase is required to have as properties, 1) the substrate specificity, i.e an 
ability to cleave both mannane and complex types; and, 2) the ability to perform the trans- 
glycosylation reaction, which is the reverse reaction of the decomposition reaction. Therefore, 
endo- 0 -N-acetylglucosaminidase derived from Mucor hiemalis can be said to be an 
appropriate enzyme for the practice of said conversion. 



The present inventors, have proposed a sugar chain conversion technique using endo- jS -N- 
acetylglucosaminidase derived from Mucor hiemalis which can alter yeast sugar chains to 
human-compatible form. (Japanese Patent Application Laid-Open No: Hei 7-59587) 

To perform sugar chain conversions such as the above, an authentic enzyme preparation of 
high purity is required in great amounts. In this case, improvement of enzyme productivity by 
conventional breeding methods using mold cells has been considered. However, since 
conventional breeding methods are limited predominantly to the method of selecting such an 
enzyme from mutant strains obtained using ultraviolet light or mutagens, the isolation of 
stable mutants is difficult. Further, conventional breeding methods are often accompanied by 
unfavorable transformations. Further, since molds generally produce protease enzymes, they 
are unfavorable for production of an enzyme for the purpose of sugar conversion. Since, in 
order to overcome these problems it is necessary to proceed through a number of purification 
steps, the work is complicated and the yield is low. For example, culturing a microorganism 
belonging to the genus Mucor which is a type of filamentous mold, even if purification of the 
supernatant of this culture is performed, contamination with protease cannot be prevented and 
preparation in large amounts is difficult because of low enzyme productivity of the 
microorganism, and thus this method was of little practical value. 

Given the above, it is desired for the purpose of mass producing endo- 0 -N- 
acetylglucosaminidase, that the gene for said enzyme be obtained and produced through the 
use of genetic engineering. Further, if the gene can be obtained, it can be expected that an 
enzyme with improved heat resistance and pH resistance and increased reaction rate can be 
obtained using protein engineering techniques. However, there have been no reported 
attempts at gene cloning to date. 

Disclosure of Invention 

It is an object of the present invention to provide an endo- 0 -N-acetylglucosaminidase, an 



endo- 0 -N-acetylglucosaminidase gene, a recombinant vector which comprises said gene, an 
organism transformed with said vector, and a method of producing endo- 0 -N- 
acetylglucosaminidase. 

The present inventors, as a result of their intensive research directed to solving the above 
problem, based on partial amino acid sequence information of said endo- 0 -N- 
acetylglucosaminidase derived from Mucor hiemalis, have succeeded in obtaining the gene 
that encodes said enzyme from a cDNA library prepared from Mucor hiemalis which is a 
bacteria which produces said enzyme, have further succeeded in expressing this gene in yeast, 
and thereby completed the present invention. 

Thus, the present invention provides the recombinant protein of (a) or (b) below: 

(a) a protein comprising the amino acid sequence represented by SEQ ID NO: 3; and, 

(b) a protein comprising an amino acid sequence derived from the amino acid 
sequence represented by SEQ ID NO: 3 by deletion, substitution, insertion, or 
addition of at least one amino acid and having the activity of endo- 0 -N- 
acet ylglucosaminidase . 

Further, the present invention provides a endo- 0 -N-acetylglucosaminidase gene which 
encodes the protein of (a) or (b) below, and a gene that hybridizes with said gene under 
stringent conditions, and which comprises DNA encoding a protein that has endo- 0 -N- 
acetylglucosaminidase activity. 

(a) a protein comprising the amino acid sequence represented by SEQ ID NO: 3; 
and, 

(b) a protein comprising an amino acid sequence derived from the amino acid 
sequence represented by SEQ ID NO: 3 by deletion, substitution, insertion, or 
addition of at least one amino acid and having the activity of endo- 0 -N- 
acetylglucosaminidase. 

Further, the present invention provides a gene comprising the DNA of (c) or (d) below: 



(c) a DNA consisting of the nucleotide sequence represented by SEQ ED NO: 2; and, 

(d) a DNA which hybridizes under stringent conditions with a DNA consisting of the 
nucleotide sequence represented by SEQ ED NO: 2, and encodes a protein having 
the activity of endo- 0 -N-acetylglucosaminidase. 

Said gene includes a gene derived from a microorganism belonging to the genus Mucor. 

Further, the present invention provides a recombinant vector which comprises said gene. 

Further, the present invention provides a transformant which comprises said recombinant 
vector. 

Further, the present invention provides a method for producing an endo- 0 -N- 
acetylglucosaminidase, comprising culturing said transformant, and collecting endo- 0 -N- 
acetylglucosaminidase from the obtained culture product. 

Below, the present invention will be described in detail. 

The present invention is characterized by culturing endo- 0 -N-acetylglucosaminidase 
producing microorganisms, purifying the endo- 0 -N-acetylglucosaminidase from the obtained 
culture, designing degenerate probes from a partial amino acid sequence of said enzyme, 
cloning a gene encoding said enzyme by performing PCR, and further, cloning a gene 
encoding said enzyme from a cDNA library of the microorganism producing endo- 0 -N- 
acetylglucosaminidase. Further, the present invention is characterized by obtaining a 
recombinant vector by introducing the cloned gene into a vector, as well as by obtaining a 
transformant by introducing said recombinant vector into a host cell. Further, the present 
invention is characterized by producing endo- 0 -N-acetylglucosaminidase in large quantities 
by culturing said transformant. 

1 . Culturing of endo- 0 -N-acetylglucosaminidase producing microorganisms 



Microorganisms for producing endo- $ -N-acetylglucosaminidase include microorganisms 
belonging to the genus Mucor, preferably Mucor hiemalis, and more preferably the Mucor 
hiemalis deposited at the National Institute of Bioscience and Human-Technology, Agency of 
Industrial Science and Technology (1-1-3, Higashi, Tsukuba-shi, Ibaraki-ken, Japan) 
(Accession No. FERM BP-4991). 

The medium composition used in culturing of these strains may be of any kind typically used 
in the culture of microorganisms. 

Carbon sources include for example sugars such as glucose, sucrose 5 mannose, galactose, 
maltose, soluble starch and dextrin. Nitrogen sources include yeast extract, trypton, etc. As 
inorganic salts, apart from the inorganic salts contained in the above nitrogen sources, salts 
such as all varieties of sodium salts, potassium salts, calcium salts, magnesium salts and 
phosphate salts may be used. Vitamins may be added optionally. 

The culture medium is sterilized by a conventional method, and the strain is inoculated into 
the medium. Thereafter, a shake culture, or aeration-agitation culture is performed at 20-30°C, 
pH 5-7 for 2 to 4 days. 

In the present invention, culture is preferably performed at 25-30°C, pH 6, with galactose as 
the carbon source, yeast extract and trypton as the nitrogen sources, with concentrations of 
both carbon and nitrogen sources at 2-3% each, and the ratio of carbon source to nitrogen 
source at 2:3, for 3 to 4 days under good aeration conditions. Where the culture is performed 
under such conditions, the amount of enzyme produced is maximized, and in comparison to 
the known method (S. Kadowaki, et al., Agric. Biol Chem., 54, 97 (1990); glucose 0.5%, 
yeast extract 1 %, peptone 1 %), about 10-fold greater productivity can be achieved. 

Further, in the present invention, to ensure the aeration conditions when culturing the 
microorganisms, use of a jar fermenter is preferable. 



2. Purification of endo- jS -N-acetylglucosaminidase 

The endo- 0 -N-acetylglucosaminidase produced by the above bacterial strain is characterized 
by preservation of the following activity. In other words, it can be characterized by its activity 
to act on the asparagine-linked sugar chain in a glycoprotein, cleave the diacetyl-chitobiose 
portion within the sugar chain, and thereby liberate the sugar chain. 

Purification of endo- 0 -N-acetylglucosaminidase can be performed by the appropriate 
combination of known methods for separation and purification. Examples of such methods 
include methods using differences in solubility such as salt precipitation and solvent 
precipitation; methods using differences in molecular weight such as dialysis, ultrafiltration, 
gel filtration and SDS-polyacrylamide electrophoresis; methods using differences in 
electrostatic charge such as ion exchange chromatography; methods using differences in 
hydrophobicity such as hydrophobic chromatography and reverse phase chromatography, and 
methods using differences in isoelectric point such as isoelectric focusing. 

In the present invention, by adopting a method of culturing which is an improvement over the 
above described known method (S. Kadowaki, et al., Agric BioL Chem., 54, 97 (1990)), and 
subjecting to many purification steps, endo- /3 -N-acetylglucosaminidase can be efficiently 
purified, and it is possible to obtain a sufficient amount of the protein to determine the amino 
acid sequence necessary to obtain the gene. The obtained enzyme, as a result of purification 
and as a result of the analysis of the gene to be described below, consists of a single gene 
product with a molecular weight of approximately 85,000, and after post-translational partial 
digestion of the gene product, it was found to be composed of 2 or more subunits including 
peptides with molecular weights of approximately 60,000 and 14,000. 

3. Cloning of the novel endo-/3 -N-acetylglucosaminidase gene 

It was clear that the endo- & -N-acetylglucosaminidase obtained from Mucor hiemalis was 



composed of at least 2 or more peptides. 



Generally, when isolating a gene encoding a specific protein, a partial amino acid sequence of 
the protein is determined, and it is possible to isolate the desired gene from a gene library with 
a mixture of oligonucleotides consisting of degenerative codons as a probe. Further, after 
obtaining a fragment by PCR such as in the present invention, it is possible to isolate the 
desired gene from a gene library using this fragment as a probe. 

However, since endo- 0 -N-acetylglucosaminidase is a hetero-oligomeric molecule consisting 
of 2 or more subunits, there is the possibility that each subunit is encoded independently by 
their separate differing genes. Further^ even if endo- 0 -N-acetylglucosaminidase is derived 
from one gene, its structure, for example, the positional relationship between the regions 
encoding the two subunits within the structural gene was unclear. 

Thus, the present inventors determined partial amino acid sequences for 2 subunits, and then 
after obtaining partial fragments by PCR, used said fragments as probes to achieve cDNA 
cloning and by analyzing the gene structure, clarified that the same gene coded for these 2 
subunits. That is, it was clarified that the novel endo- 0 -N-acetylglucosaminidase is produced 
as one polypeptide from the gene encoding this enzyme, and is processed by partial 
decomposition into 2 or more subunits. 

The gene of the present invention is cloned by, for example, the following method. 
(1) Cloning of the endo- 0 -N-acetylglucosaminidase gene 

In the present invention, an example of a DNA fragment comprising a gene encoding a novel 
endo- 0 -N-acetylglucosaminidase is the DNA fragment indicated by the restriction enzyme 
map shown in Fig. 2. This fragment can be isolated using genetic engineering methods from a 
cDNA library with an mRNA template prepared from a microorganism belonging to the 
genus Mucor ^ preferably from a strain of Mucor hiemalis, and more preferably from the strain 
of Mucor hiemalis deposited under accession number FERM BP-4991 at the National 
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Institute of Bioscience and Human-Technology, (See the method described in, for example, 
Molecular Cloning: A Laboratory Manual (Sambrook, Maniatis et al, Cold Spring Harbour 
Laboratory Press (1989)) 

Preparation of mRNA can be performed according to a typical method. For example, after 
culturing the mRNA source, Mucor hiemalis, total RNA is obtained from the cultured cell 
with a kit available on the market (ISOGEN (Nippon Gene Company)), and then purified with 
a purification kit available on the market (mRNA Purification Kit (Pharmacia Biotech)). In 
the preparation of mRNA, it is preferable to keep the culturing time short to control the 
decomposition of mRNA. 

With the thus obtained mRNA as a template, a single-strand cDNA is synthesized using an 
oligo dT primer and a reverse transcriptase enzyme. Thereupon, a duplex cDNA is 
synthesized from said single stranded cDNA. A recombinant vector is then constructed by 
incorporating the duplex cDNA into a suitable cloning vector. A cDNA library can be 
obtained by transforming E.coli using the obtained recombinant vector, and selecting 
transformants using tetracycline resistance and ampicillin resistance as an indicator. 

Here, transformation of E.coli is performed according to a method such as the Hanahan 
method [Hanahan, D. : J. MoL Biol 166 : 557-580 (1983)]. When a plasmid is to be used as a 
vector, it is necessary to include a gene for resistance to a drug such as tetracycline or 
ampicillin. Further, a cloning vector other than a plasmid, for example, X phage or the like 
can be used. 

A strain having the desired DNA is selected (screened) from the thus obtained transformant. 
Screening methods include, for example, a method of synthesizing sense and antisense 
primers corresponding to a partial amino acid sequence of endo- 0 -N-acetylglucosaminidase, 
and using this to perform a polymerase chain reaction (PCR). Template DNA may include for 
example, genomic DNA or cDNA synthesized by reverse transcription from the above- 
mentioned mRNA. As primers, in respect of the sense chain, for example, 5'- 



C ARTTRC ARCCNG AYGAYA A-3 ' (SEQ ID NO: 5) synthesized on the basis of amino acid 
sequence: PSLQLQPDDK (SEQ ID NO: 4) and 5 ' -CCH ACNG AYC ARA AYAT YA A-3 ' 
(SEQ ID NO: 7) synthesized on the basis of amino acid sequence: SYRNPEIYPTDQNIK 
(SEQ ID NO: 6), may be used. Further, in respect of the antisense chain 3*- 
GGDTGNCTRGT YTTRTARTT-5 * (SEQ ID NO: 8) synthesized on the basis of amino acid 
sequence: SYRNPEIYPTDQNIK (SEQ ID NO: 6) and 3 '-TTYCCDGTYGCD A ARTTRGT- 
5' (SEQ ID NO: 10) synthesized on the basis of amino acid sequence: 
GQRFNHRESHDVETEI (SEQ ID NO: 9), may be used. However, the present invention is 
not limited to these primers. 

Thus, the obtained DNA amplification fragment is labeled with for example 32 P 3 35 S or biotin 
and taken as a probe, and is then made to hybridize with the cDNA library of the transformant, 
which library has been denatured and immobilized onto a nitrocellulose filter. Screening can 
then be performed by searching the obtained positive strains: 

(2) Determination of the nucleotide sequence 

Determination of the nucleotide sequence of the obtained clone is performed. Determination 
of the nucleotide sequence may be performed by known methods such as the Maxam-Gilbert 
chemical modification method or dideoxy method, though typically determination of the 
sequence is performed using an automated nucleotide sequencer. (For example, PERKIN- 
ELMER 377A DNA Sequencer) 

The full length sequence of the endo- 0 -N-acetylglucosaminidase gene is indicated by SEQ 
ID NO: 1. Therein, a preferable example of the gene of the present invention, is the nucleotide 
sequence from position 71 to position 2305 (SEQ ID NO: 2) of the nucleotide sequence 
indicated by SEQ ID NO: 1. Further, the gene of the present invention includes not only a 
sequence which encodes the amino acid sequence represented by SEQ ID NO: 3 or the below 
described amino acid sequence having an equivalent sequence, but also encompasses 
degenerate isomers encoding identical polypeptides which differ in respect of degenerate 
codons only. 
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A nucleotide sequence encoding an amino acid sequence having an equivalent sequence can 
be prepared using a method such as site-directed mutagenesis. That is, mutations may be 
introduced by known methods such as the Kunkel method or Gapped duplex method, or other 
methods equivalent thereto, using for example a mutation introduction kit that employs site- 
directed mutagenesis (e.g. Mutant-K (Takara), Mutant-G (Takara)) etc., or using Takara's LA 
PCR in vitro Mutagenesis series kit. 

Further, an endo- 0 -N-acetylglucosaminidase gene includes not only DNA consisting of the 
nucleotide sequence represented by SEQ ID NO: 1 or SEQ ID NO: 2, but also a DNA which 
hybridizes under stringent conditions with said DNA- and encodes a protein having the 
activity of endo- 0 -N-acetylglucosaminidase. Stringent conditions refer to, for example, the 
conditions of a sodium concentration of 50-300mM, preferably 150mM and a temperature of 
50-68°C, preferably 65°C. 

Once the nucleotide sequence of the endo- 0 -N-acetylglucosaminidase gene (SEQ ID NO: 1) 
is identified, since the nucleotide sequence of the DNA fragment having the sequence from 
position 7 1 to 2305 of the said nucleotide sequence (open reading frame) is determined (SEQ 
ID NO: 2), it is possible to obtain the endo- 0 -N-acetylglucosaminidase gene by chemical 
synthesis; by PCR with genomic DNA as a template and the 5' and 3' terminal sequences of 
the open reading frame (SEQ ID NO: 2) (e.g. 5 *- ATGCCTTC ACTTC A ATTGC A ACC-3 * 
(SEQ ID NO: 11) and 5 ' -CTAGTTTA ATGAC A A ATCTATGC-3 * (SEQ ID NO: 12) as 
primers; or, by hybridization with a DNA fragment having the nucleotide sequence of an 
endo- 0 -N-acetylglucosaminidase gene as a probe. 

The plasmid pZL-Endo (See Example 3 below) which comprises the gene of the present 
invention was introduced into E.coli DH10B (Title: DHBpZL-Endo) and deposited at 
National Institute of Bioscience and Human-Technology (1-1-3, Higashi, Tsukuba-shi, 
Ibaraki-ken, Japan) on April 28, 1998 under accession number FERM BP-6335. 
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In the present invention, the amino acid sequence represented by SEQ ID NO: 3 or a 
polypeptide comprising an equivalent sequence, is provided as a preferable example of a 
recombinant novel endo- 0 -N-acetylglucosaminidase. Here, "equivalent sequence" refers to a 
sequence which comprises the amino acid sequence represented by SEQ ID NO: 3 and in 
which at least one amino acid is inserted, substituted, deleted or added to either end, and 
which retains said novel endo- )S -N-acetylglucosaminidase activity. Retention of novel endo- 
/3 -N-acetylglucosaminidase activity in this equivalent sequence means that the sequence 
maintains activity sufficient for it to be used in an almost identical manner under identical 
conditions to the polypeptide having the full sequence represented by SEQ ID NO: 3 in actual 
forms of use which exploit this activity. It is clear that it is possible for a person skilled in the 
art to select and produce with no particular difficulty such an equivalent sequence, referring to 
the sequence represented by SEQ ID NO: 3. For example, within the amino acid sequence 
represented by SEQ ID NO: 3, at least 1, preferably 1 to 10, more preferably 1 to 5 amino 
acids may be deleted; at least 1, preferably 1 to 10, more preferably 1 to 5 amino acids may be 
added or inserted; or, at least 1, preferably 1 to 10, more preferably 1 to 5 amino acids may be 
substituted. Accordingly, the protein of the present invention includes a polypeptide having 
the amino acid sequence from position 2 to 744 of the amino acid sequence represented by 
SEQ ID NO: 3 in the Sequence Listing (one in which the methoinine at position 1 of the 
amino acid sequence represented by SEQ ID NO: 3 has been deleted.) 

Here, through the partial amino acid sequence analysis of the present invention and gene 
structure analysis, it became clear that 2 or more natural type subunits were produced by 
cleavage of a precursor polypeptide on the C terminal side of at least the histidine at position 
510 and the asparagine at position 627 in the amino acid sequence represented by SEQ ID 
NO: 3. 

2. Construction of a recombinant vector and transformant 

The present invention provides a DNA molecule comprising the gene of the present invention, 
in particular, an expression vector. This DNA molecule can be obtained by incorporating a 
DNA fragment encoding the novel endo- 0 -N-acetylglucosaminidase according to the present 
invention into a vector molecule. Accordingly, if transformation of a host cell is performed 
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with a DNA molecule, particularly in the form of an expression vector, which includes a DNA 
fragment encoding the novel endo- # -N-acetylglucosaminidase of the present invention in a 
form such that it is replicable within the host cell and the said gene is expressible, it is 
possible to cause production of the novel endo- 0 -N-acetylglucosaminidase of the present 
invention in the host cell. 

The DNA molecule according to this invention can be constructed based on the method 
described in the above-referenced Molecular Cloning: A Laboratory Manual (supra). 

(1) Construction of a recombinant vector 

The vector to be used in the present invention may be appropriately selected from a virus, 
plasmid, cosmid vector or the like in consideration of the type of host cell to be used. 

For example, where the host cell is E.coli, bacteriophages of the X phage line, plasmids of 
the pBR line (pBR322, pBR325 etc.) and pUC line (pUC118, pUC119 etc); where the host 
cell is Bacillus subtilis, plasmids of the pUB line (pUBHO etc); and where the host cell is 
yeast, vectors of the YEp and YCp lines (e.g. YEpl3, YEp24, YCp50 etc.), or pG-3-Not used 
in the Examples below, may be used. Further, animal viruses such as a retrovirus or vaccinia 
virus, or insect virus vectors such baculovirus can be used. 

To introduce the gene of the present invention into the vector, methods for ligating the gene to 
the vector such as first cleaving the purified DNA with a suitable restriction enzyme, and then 
inserting the gene at suitable restriction sites or multicloning sites of the vector DNA, are 
employed. 

It is necessary that the gene of the present invention be incorporated into the vector such that 
the function of this gene is exhibited. Thus, it is preferable that the vector of the present 
invention includes a selective marker. Drug-resistant markers and auxotrophic markers can be 
used as selective markers. 
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Further, it is preferable that the DNA molecule to be used as the expression vector of the 
present invention has DNA sequences necessary for the expression of the novel endo- 0 -N- 
acetylglucosaminidase gene such as transcription regulating signals and translation regulating 
signals such as, for example, a promoter, transcription initiation signal, ribosome binding site, 
translation stop signal and a transcription termination signal. 

(2) Construction of the transformant 

The transformant of the present invention can be obtained by introducing the recombinant 
vector of the present invention into a host in a manner allowing expression of the subject gene. 
There is no particular limitation on the host that may be used as long as it allows expression 
of the gene of the present invention. Examples include bacteria of the genus Escherichia such 
as Escherichia coli, of the genus Bacillus such as Bacillus subtilis, or of the genus 
Pseudomonas such as Pseodomonas putida etc., and yeasts such as Saccharomyces cerevisiae, 
Schizosaccharomyces pombe, Candida boidinii and Pichia pastoris. 

As host cells, apart from E.coli, B. subtilis, and yeast, animal cells such as COS cells and 
CHO cells etc., and insect cells such as Sf9 and Sf21 etc. may be used. 

Where a bacterium such as E.coli is used as a host, the recombinant vector of the present 
invention preferably is autonomously replicable within the bacterium and comprises a 
promoter, ribosome binding site, the gene of the present invention, and a transcriptional 
termination sequence. A gene which controls the promoter may also be included. 

Examples of E. coli include Escherichia coli K12, DH1, DH5a 9 JM109 etc. Examples of 
Bacillus subtilis include Bacillus subtilis MI 114, 207-21 etc. It is known that there exist 
strains of Bacillus subtilis that secrete proteins out of the microorganism body. There are also 
known strains that secrete hardly any protease. The use of such strains as hosts is preferable. 

As a promoter, a promoter within the inserted fragment that is able to function even in the 
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host, may of course be used. Examples of promoters in E.coli include lactose operon (lac) and 
tryptophan operon (trp), etc. 

As a method for introducing the recombinant vector into bacteria, any method for introducing 
DNA into bacteria may be employed and there is no particular limitation thereon. Examples 
include a method using calcium ions [Cohen, S.N. et al.: Proc. Natl. Acad. Sci., USA, 
69:2110-2114(1972)], and electroporation. 

Where yeast is to be the host, Saccharomyces cerevisiae, Schizosaccharomyces pombe, 
Candida utilis, etc., may be used. In this case, there is no particular limitation on the promoter 
that may be used as long as it can express in yeast. For example, promoters such as alcohol 
dehydrogenase (ADH), acidic phosphatase (PHO), galactose gene (GAL),and glyceraldehyde- 
3-phosphate dehydrogenase gene (GAPDH) promoters, heat shock protein promoter, MF a 1 
promoter, PGK promoter, GAP promoter and AOX1 promoter may preferably be used. 

As a method for introducing the recombinant vector into yeast, any method for introducing 
DNA into yeast may be employed and there is no particular limitation thereon. Examples 
include electroporation [Becker, D.M. et al.: Methods. EnzymoL, 194: 182-187 (1990)], 
spheroplast method [Hinnen, A. et al.: Proc. NatL Acad. Sci., USA, 75: 1929-1933 (1978)], 
and lithium acetate method [Itoh, H. : J. BacterioL, 153:163-168 (1983)]. 

Where an animal cell is to be the host, monkey cell COS-7, Vero, Chinese hamster ovary cell 
(CHO cell), mouse L cell, rat GH3 and human FL cell, etc. may be used. As a promoter, SR ct 
promoter, SV40 promoter, LTR promoter, CMV promoter or the like can be used. Also, the 
initial promoter of human cytomegalovirus or the like may be used. 

Examples of a method for introducing the recombinant vector into an animal cell include 
electroporation, calcium phosphate method and lipofection method. 

Where an insect cell is to be the host, Sf9 cell, Sf21 cell and the like may be used. 
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Examples of a method for introducing the recombinant vector into an insect cell include 
calcium phosphate method, lipofection, and electroporation. 

4. Production of the protein of the present invention 

A protein of the present invention has an amino acid sequence encoded by a gene of the 
present invention, or has said amino acid sequence into which said modification of at least 1 
amino acid has been introduced, and has the activity of endo- £ -N-acetylglucosaminidase. 

The protein of the present invention can be obtained by culturing the above-mentioned 
transformant and collecting the protein from this culture product. "Culture product" refers to 
either the culture supernatant, the cultured cells or microbial cells, or disrupted cells or 
microbial cells. 

Either a natural medium or synthetic medium may be used as a medium for culturing 
transformants obtained with microorganisms such as E. coli or yeast as hosts, as long as it 
contains a carbon source, a nitrogen source, inorganic salts and the like, which are able to be 
assimilated by the microorganism and it may be used to efficiently perform a culture of the 
transformant. 

As carbon sources, carbohydrates such as glucose, fructose, sucrose and starch, organic acids 
such as acetic acid and propionic acid, and alcohols such as ethanol and propanol are used. 

As nitrogen sources, inorganic acids such as ammonia, ammonium chloride, ammonium 
sulfate, ammonium phosphate, organic acids such as ammonium acetate, or other nitrogen 
containing compounds, as well as peptone, meat extract, corn steep liquor or the like are used. 

As inorganic matter, potassium (I) phosphate, potassium (II) phosphate, magnesium 
phosphate, magnesium sulfate, sodium chloride, iron (I) sulfate, manganese sulfate, copper 
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sulfate and calcium carbonate, etc are used. 

Culturing is typically performed under aerobic conditions such as a shake culture or aeration- 
agitation culture, at 37°C at 12 to 72 hours. During culturing, pH is maintained at 4-7.5. 
Regulation of pH is performed using inorganic or organic acid or alkali solutions and the like. 

During culturing, antibiotics such as ampicillin and tetracycline may be added to the medium 
as required. 

When the microorganism transformed with an expression vector containing an inducible 
promoter is cultured, an inducer may be added to the culture as required. For .example > when 
culturing a microorganism transformed with an expression vector which uses a Lac promoter, 
isopropyl- 0 -D-thiogalactoside (IPTG) and the like may be added to the culture, and when 
culturing a microorganism transformed with an expression vector using a trp promoter, indole 
acetic acid (IAA) may be added. 

As culture media for culturing a transformant obtained from an animal host cell, the generally 
used RPMI1640 medium, DMEM medium or these medium to which fetal calf serum and the 
like have been added, are used. 

Culturing is typically performed over 2 to 10 days at 37°C in the presence of 5% C0 2 . 
Antibiotics such as kanamycin and penicillin and the like may be added during culturing as 
required. 

After culturing, when the protein of the present invention has been produced within the 
microorganisms or cells, the protein of the present invention is extracted by disrupting the 
microorganisms or cells. When the protein of the present invention has been produced outside 
the microorganisms or cells, the culture fluid may be used as is, or the microorganisms or 
cells removed by centrifugal isolation, etc. 
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Purification of the recombinant novel endo- 0 -N-acetylglucosaminidase, is performed by the 
suitable combination of known separation and purification methods. Examples of such 
methods include methods using differences in solubility such as salt precipitation, and solvent 
precipitation methods; methods using differences in molecular weight such as dialysis, 
ultrafiltration, gel filtration and SDS-polyacryl electrophoresis; methods using differences in 
electrostatic charge (valence) such as ion exchange chromatography; methods using 
differences in hydrophobicity such as hydrophobic chromatography and reverse phase 
chromatography; and, methods using differences in isoelectric point such as isoelectric 
focusing. 

In the present invention, as indicated in the Examples below, when this gene was made to 
express under the control of GAPDH promoter, in a Saccharomyces cerevisiae host, high 
enzyme activity was confirmed within the cell extract. This indicated that it was possible to 
produce active novel endo- 0 -N-acetylglucosaminidase in large quantities through the 
expression of the gene of the present invention in the recombinant. 



Brief Description of Drawings 



Figure 1 is an electrophoresis photograph indicating results for purified endo- 0 -N- 
acetylglucosaminidase. 

Figure 2 is a restriction enzyme map of pZL-Endo which comprises the full length sequence 
of a novel endo- 0 -N-acetylglucosaminidase gene. 

Figure 3 indicates the entire nucleotide sequence of the fragment inserted at the Sal I-Not I 
sites of pZL-Endo which comprises the full length sequence of a novel endo- 0 -N- 
acetylglucosaminidase gene. 

Figure 4 indicates the entire nucleotide sequence of the fragment inserted at the Sal I-Not I 
sites of pZL-Endo which comprises the full length sequence of a novel endo- 0 -N- 
acetylglucosaminidase gene. (Continuation of Figure 3) 

Figure 5 shows the deduced amino acid sequence from novel endo- 0 -N- 
acetylglucosaminidase gene and the nucleotide sequence of a DNA encoding said amino acid 
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sequence. 

Figure 6 shows the deduced amino acid sequence from novel endo- 0 -N- 
acetylglucosaminidase gene and the nucleotide sequence of a DNA encoding said amino acid 
sequence. (Continuation of Figure 5) 

Figure 7 shows the deduced amino acid sequence from novel endo- 0 -N- 
acetylglucosaminidase gene and the nucleotide sequence of a DNA encoding said amino acid 
sequence. (Continuation of Figure 6) 



Figure 8 shows the structure of expression vector pGEndo-SC ,which comprises a novel endo- 
ws -N-acetylglucosaminidase gene for use in Sdccharomyces cerevisiae. 

Figure 9 is a chromatography photograph which indicates the expression of novel endo- 0 -N- 
acetylglucosarninidase in yeast into which said enzyme gene has been introduced. 



Best Mode for Carrying Out the Invention 

Below, the present invention will be described more specifically using examples. It is 
contemplated, however, that the technical scope of the present invention is not to be limited to 
these examples. Unless otherwise indicated herein the procedures were performed in 
accordance with the methods as are described in Molecular Cloning: A Laboratory Manual 
(Sambrook, Maniatis et al, Cold Spring Harbour Laboratory Press (1989)). 

Example 1 : Measurement of Enzyme Activity 

Measurement of the activity of endo- 0 -N-acetylglucosaminidase was performed 
fundamentally in accordance with the method indicated in S. Kadowaki, et al., Agric. Biol. 
Chem., 54, 97 (1990). That is, the reaction was performed using dansylated human asialo- 
transferrin glycopeptide (DNS-GP) as a substrate in a potassium phosphate buffer (pH6.0) at 
37°C and activity measured by thin layer chromatography (TLC) or by HPLC under the 
following conditions. 
Analysis conditions for TLC: 

Development phase: HPTLC silica gel 60 (Merck) 

Solvent: butanol: acetic acid: water =2:1:1 
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Detection: detection by fluorescence method 
Analysis conditions for HPLC: 

Column: TSK-gel ODS80TM (TOSO) 

Solvent: 25mM sodium borate buffer (pH 7.5) +11 % acetonitrile 
Column temperature: 50°C 
Flow rate: O.Sml/minute 
Detector: fluorescence detector 

Activity was defined such that 1 unit is equivalent to the amount of enzyme required to 
produce 1 fi mol of dansylated asparagyl acetylglucosamine in one minute under the above 
described HPLC measurement conditions- 
Example 2: Culturing Mucor hiemalis 

A medium (galactose 2%, yeast extract 3%) 100ml was placed in a 500ml Sakaguchi flask, 
and inoculated with a third to a fifth of a slant of Mucor heimalis spore, and culture was 
performed at 28°C for 2 days. The microbial cells separated from the culture by suction 
filtration were used in the preparation of mRNA. 

Regarding preparation of the enzyme, the above culture, following cultivation, was 
transferred to 2 liter of the medium in a 3 liter jar fermenter, and culturing was performed for 
four days under conditions of 28°C, rotation speed of 300-400rpm, and aeration volume of 2 
liters per minute. 

Example 3: Purification of novel endo-# -N-acetylglucosaminidase 

Four liters of the culture (from two batches of cultures performed in 3-liter jar fermenter) 
obtained in Example 2 was subjected to suction filtration to separate the cells, and 
concentrated to 200ml using ultrafiltration. This crude enzyme solution was subjected to ion 
exchange chromatography (Pharmacia Q Sepharose FF, 500ml) equilibrated with lOmM 
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potassium phosphate buffer containing 5mM EDTA. The column was washed with the same 
buffer solution followed by elution of endo- 0 -N-acetylglucosaminidase with a linear 
gradient of 900ml 0M-0.3M NaCl. Reagents were added to the active fraction such that the 
final concentrations of 1M ammonium sulfate, 50mM potassium phosphate containing 5mM 
EDTA were attained. The product was subjected to hydrophobic chromatography (TOSO 
Phenyl-TOYOPEARL 650S 200ml) equilibrated with the same buffer. The column was 
washed with this same buffer and then endo- 0 -N-acetylglucosaminidase was eluted with a 
linear gradient of 1M-0M ammonium sulfate containing ImM EDTA (600ml). 

The obtained eluate was concentrated to 5ml with ultrafiltration membrane (molecular weight 
cut-off 13000) then washed and desalted with lOmM potassium phosphate buffer containing 
ImM EDTA and 0.15M NaCl (pH7.0). Next, the active fraction was applied to a gel filtration 
chromatography column (Pharmacia Sephacryl S300) equilibrated with the same buffer 
solution, and endo- 0 -N-acetylglucosaminidase was eluted with the same buffer solution. 

The active fraction was concentrated with ultrafiltration membrane (molecular weight cut-off 
13000) then washed and desalted with lOmM potassium phosphate containing ImM EDTA 
(pH7.0). Next, the active fraction was subjected to hydroxy apatite chromatography (TOSO 
TSK-gel HA 1000) equilibrated with the same buffer. The column was washed with this same 
buffer and then endo- 0 -N-acetylglucosaminidase was eluted with a linear gradient of 0M- 
0.3M ammonium sulfate containing ImM of EDTA (pH7.0)(30ml). 

The active fraction was concentrated with ultrafiltration membrane (molecular weight cut-off 
13000) then washed and desalted with a 25mM bis-tris buffer solution adjusted to pH 7.1 with 
iminodiacetic acid. The active fraction was subjected to isoelectric point chromatography 
(Pharmacia, MonoP) with equilibrated with the same buffer solution. The column was washed 
with this same buffer and then endo- 0 -N-acetylglucosaminidase was eluted with 50ml of 
10% Polybuffer 74 (Pharmacia) adjusted to pH 3.9 with iminodiacetic acid. 

The active fraction was concentrated with ultrafiltration membrane (molecular weight cut-off 



13000) then washed and desalted with lOmM potassium phosphate buffer containing ImM 
EDTA (pH7.0). Next, the active fraction was subjected to ion exchange chromatography 
(Pharmacia, MonoQ). The column was washed with this same buffer and then endo- & -N- 
acetylglucosaminidase was eluted with a linear gradient of 0M-0.3M NaCl (30ml). 

The active fraction was concentrated with ultrafiltration membrane (molecular weight cut-off 
13000) then washed and desalted with 50mM potassium phosphate buffer containing ImM 
EDTA (pH7.0), to obtain an enzyme sample. It should be noted that each column 
chromatography was performed using FPLC (Pharmacia). 

The mass of the protein was measured using Protein Assay Kit manufactured by BioRad. or 
by absorbance (280nm). The molecular weight and isoelectric point of the protein was 
measured by SDS-PAGE (15-25% gradient), gel filtration chromatography, IEF-PAGE and 
the like. 

From the activity of each of the fractions in two-dimensional electrophoresis by Native- 
SDSPAGE and IEF-SDSPAGE, and in the above chromatography, and from the results of 
SDS-PAGE analysis, bands of at least 60kDa (denoted p60) and 14kDa (denoted pl4) on 
SDS-PAGE were detected (Fig. 1). 

Example 4: Determination of a partial amino acid sequence of novel endo- 0 -N- 
acetylglucosaminidase 

Partial amino acid sequence analysis was performed according to the Iwamatsu method 
(Seikagaku (Biochemistry) 63, 139-143 (1991)). The purified enzyme was suspended in an 
electrophoresis buffer solution (10% glycerol, 2.5% SDS, 2% 2-mercaptoethanol, 62mM Tris 
HC1 buffer (pH6.8)), and was subjected to SDS polyacrylamide electrophoresis. After 
electrophoresis, said enzyme was transferred from the gel to 10cm by 7cm PVDF membrane 
((ProBlot) Applied Biosystems) by electroblotting. Using ZARUTO Blot Type lis 
(Zarutorius) as an electroblotting device, electroblotting was preformed at 160mA for 1 hour. 
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After transfer, the portion of the membrane onto which the said enzyme was transferred was 
cut out and part of this portion was directly analyzed with a gas phase protein sequencer 
whereby the N-terminal amino acid sequence was determined. Further, the remaining 
membrane was soaked in 300 (A 1 of a reducing buffer solution (8M guanidine-HCl, 0.5M Tris- 
HC1 buffer (pH8.5), 0.3%EDTA, 2% acetonitrile), lmg of ditiothreitol (DTT) was added, and 
reduction was performed for about 1 hour at 25°C in the presence of argon. To this, 3.0mg of 
monoiodoacetic acid dissolved in 10 Ml of 0.5N sodium hydroxide solution was added and 
was stirred for 20 minutes under darkened conditions. PVDF membrane was taken out, and 
after thorough enough washing with 2% acetonitrile, it was soaked in lOOmM acetic acid 
containing 0.5% nol_win.vln_vn:olidone-40- and allowed to stand for 30 minutes. Afterward. 

4J ./ i " «/ ' •» " . ._- _ _ — . , 

the PVDF membrane was washed thoroughly with water. A 1mm square was cut from the 
membrane and soaked with a digestion buffer (8% acetonitrile, 90mM Tris-HCl buffer pH 
9.0) and 1 pmol of Acromobacter protease I (Wako Pure Chemicals Industries) was added 
thereto. The enzyme was then digested for 15 hours at room temperature. The digestion 
product was separated by reversed phase high performance liquid chromatography on C18 
column (Wakosil AR II C18 300 A 2.0 x 150mm(Wako Pure Chemicals Industries)), and 
seven peptide fragments were obtained in respect of each subunit. 

As elution solvents for the peptides, solvent A (0.05% trifluoroacetic acid) and solvent B (2- 
propanol/acetonitrile solution 7:3, containing 0.02% trifluoroacetic acid) were used, and the 
elution was performed for 40 minutes with a linear concentration gradient from 2 to 50% of 
solvent B, at a rate of 0.25mI7min. 

Amino acid sequences of fragmented peptides obtained from the novel endo- 0 -N- 
acetylglucosaminidase candidate protein were analyzed. Fragments derived from p60 and 
fragments derived from pi 4, were denoted as p60-AP and pl4-AP, respectively. Sequencing 
of the obtained fragmented peptides was carried out using a gas phase protein sequencer 
PPSO-10 (Shimadzu) by the Edman degradation method in accordance with the manual. 
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The obtained partial amino acid sequences are indicated in Table 1 . 



Table 1 - Partial amino acid sequence of endo- /9 -N-acetylglucosaminidase candidate protein 
p60 

p60-AP-5 PSLQLQPDDK (SEQ ID NO: 

17) 

p60-AP-6 (K) SYRNPEIYPtDQNIK (SEQ ID NO: 18) 

p60-AP-8 (K) FNVSSVALQPRVK (SEQ ID NO: 19) 

p60-AP-9 (K) MDRLFLCGgK (SEQ ID NO: 

20) 

S 

p60-AP-ll (K) GQRFNHREShDVETEI (SEQ ID NO: 

21) 

mal p Hit 

pl4 

pl4-AP-l (K) EG YISS SGSIDLSLN (SEQ ID NO: 22) 



In the amino acid sequence described in Table 1, amino acids indicated by lower case letters 
of the alphabet are undetermined amino acids within the amino acid sequence. 

Since Acromobacter protease I, used on the partial amino acid sequence, specifically cleaves 
on the carboxyl side of a lysine residue, K (lysine) has been written in brackets on the N- 
terminus side of the sequences below. 

Since the p60-AP-5 was found to be the N-terminus amino acid sequence, K (Lysinse) in 
brackets was removed. 

In respect of the Acromobacter protease I digestion products of p60 and p!4, molecular 
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weight analysis was also performed with on-line mass analyzer (PE Sciex API-HI), combined 
with reverse phase high performance liquid chromatography (Hitachi L6200) on C-18 column 
(GL Science Inertsil ODS-3 0.5 x 40mm). The results of analysis are shown in Table 2. 



Table 2 Results of /Lys-C digest product on the 60kDa peptide (p60) and the 14kDa 



peptide (pi 4) by LC/MS analysis 



E>60 


Measured 
Value 


Ideal 
Value 


Error 
Margin 


Corresponding 
Sequence 


SEQ 
ID 

NO: 


AP-1: 
AP-2: 

AP-3: 
AP-4: 
AP-5: 

AP-7: 

AP-8: 
AP-9: 

AP-10: 
AP-11: 


950.50 
1160.50 

733.25 
1838.50 
1141.00 
1157.50 

1 11 A 7^ 

701.50 
1544.50 
1621.00 
1444.75 

945.75 
2655.00 
2206.75 
2335.00 


950.47 
1160.56 

nil a i 
733.41 

1837.91 

i 140:59 

1556.75 

1 11 A QA 

701.39 
1543.79 
1620.73 
1444.83 

945.58 
2655.33 
2206.11 
2335.16 


+0.03 
-0.06 

A 1 A 
-U.IO 

+0.59 
+0.41 
+0.75 

yj. i Z7 

+0.11 
+0.71 
+0.27 
-0.08 
+0.17 
-0.33 
+0.64 
-0.16 


(K)NIQGNNYK 
(K)YSDYPPPPPK 

(K)SYRNPEIYPTDQNIK 

( )PSLQLQPDDK....p60 N-terminus 

(K)NTDGIFLNYWWK 

(K)LTVAB'H p60 C-terminus 
(K)PQLLLTHDMAGGYK 
(K) SMNELRDWTPDEK 
(K)FNVSSVALQPRVK 
(K)LAPVSFALK 

(K)GQRFNHRESHDVETEISIPLYK 
(K)ITS SLDB*DHG AFLGGTSLIIK 
(K)NELFFKNTDGIFLNYWWK 


23 
24 

Z3 

18 
17 
26 
97 
28 
29 
30 
19 
31 
32 
33 
34 


pl4 


Measured 
Value 


Ideal 
Value 


Error 
Margin 


Corresponding 
Sequence 


SEQ 

ID 

NO: 


AP-1: 
AP-2: 
AP-3: 


888.75 
1392.50 
1541.50 
1608.50 


888.45 
1392.76 
1541.73 
1608.84 


+0.30 
-0.26 
-0.23 
-0.34 


(K)IVIEAVNK 

( )SSRIIQDLFWK pl4 N-terminus 

(K)EGYISSSGSIDLSLN..pl4 C-terminus 
(K)TDSSRHQDLFWK 


35 
36 
22 
37 



* B represents cysteine, carboxymethyl. 



In Table 2, the fragment having a measured value of 701.50 in respect of mass (M+H+) is 
p60-AP-7, and the fragment having a measured value of 1541.50 is nearly identical to the 
molecular weight of p 14- AP-3, and it was found that the amino acid at this C-terminus was 
not K (lysine). Regarding the fragments digested with Acromobacter protease I, since, due to 
the substrate specificity of this enzyme, fragments other than C-terminus fragment of the 
subunit itself, will have a C-terminus amino acid residue that is lysine (K), it is clear that 
these fragmented peptides were the C-terminus fragments of the subunits p60 and pi 4. 
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A homology search on the determined p60 and pl4 partial amino acid sequences using the 
protein database BLASTP indicated that the obtained sequences were novel. From the above 
results, gene cloning of p60 and pl4 was performed selecting the p60 and pl4 as endo- 0 -N- 
acetylglucosaminidase candidates. 



Example 5: Construction of a Mucor hiemalis strain cDNA library. 

Firstly, total RNA was extracted from 5g of the microorganisms obtained in Example 2 using 
ISOGEN (Nippon Gene, Inc.). From the extracted total RNA, mRNA was purified using an 
mRNA Purification Kit (Pharmacia Biotech). cDNA was synthesized from the mRNA using a 
Superscript™ Lambda System for cDNA Synthesis and X Cloning Kit (GIBCO BRL), and 
then ligated to a Sal I adapter, and finally linked (or ligated) to A ZipLox™Sal I-Not I Arms 
(GIBCO BRL). Packaging was performed using Gigapack HI Gold Packaging Extract 
(Stratagene), for infection of E.coli Y1090 strain thereby constructing the cDNA library. 



Example 6: Cloning of novel endo-/3 -N-acetylglucosaminidase cDNA. 



PCR primers were designed based on the partial amino acid sequences p60-AP-5, p60-AP-6, 
p60-AP-ll, These sequences are indicated below. The symbols used herein are all based on 
IUPAC-IUB. 



p60-AP-5 

p60-AP-5F 5' CARTTRCARCCNGAYGAYAA 3' (sense primer) (SEQ ED NO: 5) 
p60-AP-6 

p60-AP-6F 5' CCHACNGAYCARAAYATYAA 3' (sense primer) (SEQ ID NO: 7) 
p60-AP-6R 3' GGDTGNCTRGTYTTRTARTT 5' (antisense primer) (SEQ ID NO: 8) 
p60-AP-ll 

p60-AP-HR 3' TTYCCDGTYGCDAARTTRGT 5' (antisense primer) (SEQ ID NO: 10) 
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Genomic DNA was purified from Mucor hiemalis culture mass by the phenol method and 
when genomic PCR (30 seconds at 94°C, 1 minute at 55°C,1 minute at 72°C, 30 cycles) was 
performed, specific amplified bands were confirmed. Regarding p60, with a combination of 
p60-AP-5F and p60-AP-HR primers, a PCR fragment of 1.7kb, with combination of p60-AP- 
5F and p60-AP-6R, a PCR fragment of 1.5kb, and with a combination of p60-AP-6F and p60- 
AP-11R primers, a PCR fragment of 0.2kb was obtained. Regarding this fragment, subcloning 
into pCR-Script Amp was performed using a pCR-Script cloning kit (Stratagene). It was 
predicted from analysis by restriction enzyme digestion, that the amplification fragment of 
p60-AP-5F and p60-AP-llR included the amplification fragments of p60-AP-5F and p60-AP- 
6R, and p60-AP-6F and p60-AP-HR. Thus, the nucleotide sequence of the amplification 
fragment of p60-AP-5F and p60-AP-HR was determined using PRISM Ready Reaction kit 
(Applied Biosystems) and PRISM 3 7 7 DNA sequencer (Applied Biosystems). Gene 
analysis was performed using DNASIS (Hitachi Software Engineering Co., Ltd.) etc. 

As a result, the amplification fragment of p60-AP-5F and p60-AP-HR included the 
determined other partial amino acid sequence. Therefore, it was determined that this fragment 
was a part of the p60 gene. Thus, new DNA primers were constructed based on the inside 
sequence of the PCR amplification fragment, and RT-PCR was then performed (under the 
same conditions as genomic PCR), using Access RT-PCR System (Promega) and with the 
mRNA obtained in Example 5 as a template. The sequences of the newly constructed DNA 
primers are as follows: 

p60-AP-5NF 5' CACTTAAGTCTATGAATGAG 3' (sense primer) (SEQ ID NO: 13) 
p60-AP-6NR 3' CGATAGCTTTAGGTCTCTAA 5' (antisense primer) (SEQ ID NO: 14) 

As a result, a fragment of approximately 1.2kb was amplified. Upon sequencing the amplified 
fragment it was found that a fragment not including introns had been obtained. Therefore, 
cDNA cloning was performed using this fragment as a probe. Labeling was performed with 
a - 32 P dCTP (HOTBq/mmol) using Megaprime DNA labeling systems (Amersham). 



The full-length gene was obtained from the cDNA library, obtained in Example 5, by plaque 
hybridization. As a result, 5 positive clones were obtained from 200,000 plaques. Of these, 4 
clones were subjected to secondary screening thereby obtaining a single plaque. Further, 
E.coli DH10B was infected with phage fluid obtained from the plaque, and a plasmid derived 
from pZLl was recovered from the phage. Restriction enzyme analysis was performed in 
respect of these clones, and nucleotide sequence analysis of the clone comprising the longest 
upstream region was performed. This plasmid was named pZL-Endo. (Fig. 2) 

The nucleotide sequence of the inserted Sal I-Not I fragment of about 2.3kb was determined. 
In other words, the restricted fragments were subcloned into pBluescript II KS+ (Stratagene) 
or pUCl 18 (Takara Shuzo). and by producing a continuous deletion mutant using exonuclease 
HI and mung bean nuclease, a plasmid having a variety of deletion mutations was constructed, 
and the sequence of a Sal I-Not I fragment consisting of 2370bp was determined using a DNA 
sequencer. (Figs. 3-4, SEQ ID NO: 1) 

Analysis of the predicted region of the structural gene was performed and it was found that 
there existed an open reading frame encoding an amino acid sequence consisting from 744 
amino acid residues (deduced molecular weight 85kDa, Figs 5-7, SEQ ID NO: 2), and this 
amino acid sequence comprised all of the determined partial amino acid sequences of p60 and 
pi 4, Since the amino acid next to p60-AP-5 on the N-terminus side was not lysine but 
methionine, it was confirmed that ATG encoding this methionine was the start codon for 
translation. It had therefore been clarified that the N-terminus of the enzyme of the present 
invention was proline. 

On the other hand, as with the result of qualitative analysis, it was found that pl4-AP-3 was 
the C-terminus of the protein encoded by the gene of the present invention. Further, together 
with the results of mass analysis, it was predicted that at least one type of pl4 N-terminus was 
serine, the amino acid at position 628 in the amino acid sequence indicated in SEQ ID NO: 2. 

From the above, it was clear that the 5' region of the gene of the present invention encodes 
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p60, and the 3' region encodes pi 4. Since the N-terminal signal sequence could not be found 
in the amino acid sequence, the enzyme of the present invention is thought to be an 
intracellular protein. However, since there exists a plurality of bands in Figure 1, the enzyme 
of the present invention is thought to be affected by action of a protease due to the lysis of the 
cell. 

Example 7: Construction of the endo- 0 -N-acetylglucosaminidase gene expression vector 

In this Example, an expression vector for Saccharomyces cerevisiae, complementary to a 
TRP1 gene, including an endo- /3 -N-acetylglucosaminidase gene and a GAPDH gene 
promoter-PGK terminator was constructed. 

To obtain the open reading frame encoding the 744 amino acids confirmed in Example 3, 
DNA primers based on DNA sequences equivalent to the amino acid sequences of the N- 
terminus and C-terminus to which a Not I site had been added at both termini, were 
synthesized and PCR was performed with pZL-Endo as a template thereby obtaining an 
amplified fragment. The following are the sequences of the sense and antisense primers: 
Endo-Not-F (sense primer) 

5' GGGGCGGCCGCTTTTATTTTACATAAATATGCCTTCACTTC 3 5 (SEQ ID NO: 

15) 

Endo-Not-R (antisense primer) 

5' CCCGCGGCCGCCTAGTTTAATGACAAATCTATGCTACC 3' (SEQ ID NO: 16) 

After separation by agarose gel electrophoresis, the amplified fragment was recovered and 
purified using Prep-A-Gene DNA Purification System (Bio-Rad). Further, after digestion of 
this fragment with Not I, it was purified and inserted into the Not I of pBluescript II KS+ 
thereby producing pBlue-Endo-Not. 

Since the novel endo- 0 -N-acetylglucosaminidase gene is the gene derived from mold, it was 
thought that expression in yeast was suitable. An expression plasmid for Saccharomyces 
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cerevisiae with trpl gene as a selective marker, said trp 1 gene comprising a Saccharomyces 
cerevisiae glyceraldehyde-3 -phosphate dehydrogenase gene (GAPDH) promoter, a 3- 
phophoglycerate kinase (PGK) gene terminator, and a tryptophan synthesis gene TRP1 gene, 
was constructed with expression vector pG-3 (Methods in Enzymology Vol. 194 p. 389). 
Said pG-3-Not was constructed by digesting pG-3 with BamHI, blunt-ending by Klenow 
processing followed by addition of a Not I linker. 

The above-mentioned pBlue-Endo-Not was digested with Not I and an insertion fragment of 
approximately 2.3kb separated and purified by agarose gel electrophoresis. This fragment was 
inserted at the Not I site of pG-3-Not thereby constructing pGEndo-SC. (Fig. 8) 

Example 8: Expression of novel endo- /3 -N-acetylglucosaminidase in Saccharomyces 
cerevisiae. 

A pep4 gene disrupted strain of the yeast Saccharomyces cerevisiae YPH500 strain 
(Stratagene) was used as a host. A pep4 gene disrupted strain was produced by the method of 
Sikorski, R.S. and Hieter, P (Genetics Vol. 122 19-27 (1989)). The said strain was 
transformed with 10 M g of pGEndo-SC. Transformation was performed by the lithium acetate 
method (see W095/32289) and transformants were selected on a culture plate not including 
tryptophan (yeast nitrogen base 0.67%, casamino acid 0.5%, glucose 1%). 

Confirmation of the activity of intracellular novel endo- 0 -N-acetylglucosaminidase was 
performed in respect of the transformants. For the transformed cells cultured for 2 days at 
30°C in a 5mL YPD medium (yeast extract 1%, polypeptone 2%, glucose 2%), centrifugation 
was performed at 1500g for 5 minutes at 4°C, thereby separating the supernatant and the cell 
mass. The cell mass was washed with distilled water. Ten microliters of a mixture of 50mM 
potassium sulfate buffer (pH 6.0) and 5mM EDTA was added to the cell mass and suspended 
well. Further, 50mg of glass beads was added and after vigorous stirring, centrifiiged, and the 
supernatant taken was as the cell extract. 
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Measurement of activity was performed by TLC or HPLC using DNS-GP as a substrate. 
Results with TLC are indicated in Figure 9. As with the sample reacted with the enzyme 
purified from the supernatant of the Mucor hiemalis culture, a peak identical with the 
dansylated asparagyl acetylglucosamine (DNS-Asn-GlcNAc) was obtained from the pGEndo- 
SC product. On the other hand, no peak corresponding to DNS-Asn-GlcNAc was detected 
from the culture supernatant of the negative control strain transformed with pG-3-Not. The 
pGEndo-SC cell extract was concentrated by a factor of ten, desalted, reacted with DNS-GP 
and the peak corresponding to DNS-Asn-GlcNAc was fractionated using HPLC under the 
above-mentioned conditions. The fractionated samples were concentrated with an evaporator, 
and mass spectrometry analysis was performed thereon. As a result, it was confirmed that the 
mass spectrometry results matched with the DNS-Asn-GlcNAc results. Thus it was apparent 
that the gene product encoded by the pGEndo-SC insertion fragment was a novel endo- # -N- 
acetylglucosaminidase. 

Table 3 indicates the activity (product amount) per liter of the culture of the novel endo- /3 -N- 
acetylglucosaminidase of this invention. This activity was 48 times the value for Mucor 
hiemalis. 



TABLE 3 Novel endo- $ -N-acetylglucosaminidase activity 



Activity (Unit/Liter) 


M. hiemalis culture supernatant 


0.9 


Culture fluid of 5. cerevisiae into which 


43.2 


novel endo- 0 -N-acetylglucosaminidase gene 




was introduced* 





* After cell collection from the culture, the cells were disrupted with glass beads. After 
separation of the supernatant using a centrifuge, the activity of the supernatant was measured, 
and the activity per culture volume was calculated from this value. 

This specification incorporates the content described in the specification and/or the drawings 
of Japanese Patent Application No. 10-141717 which is the application upon which priority 
for the present application is based. 



All publications, patents and patent applications cited herein are incorporated herein by 
reference in their entirety. 

Industrial Applicability 

The present invention provides an endo- 0 -N-acetylglucosaminidase, an endo- 0 -N- 
acetylglucosaminidase gene, a recombinant plasmid which comprises said gene, an organism 
transformed with said plasmid, and a method of producing endo- 0 -N-acetylglucosaminidase. 

By introducing a vector which comprises the gene of the present invention into a host 5 and 
causing the gene to be expressed, endo- 0 -N-acetylglucosaminidase can be produced 
efficiently and in large quantities. 

The enzyme of the present invention is an industrially important enzyme in the analysis and 
assay of sugar chains and the modification of sugar chains. The transformants obtained by the 
present invention produce the subject enzyme in large quantities and make a large 
contribution to industries that employ these enzymes. 

Free Text of Sequence Listing 

SEQ ID NO: 4: A partial amino acid sequence of endo- 0 -N-acetylglucosaminidase 

SEQ ID NO: 5: An oligonucleotide designed from a partial amino acid sequence of endo- 0 - 

N-acetylglucosaminidase 

SEQ ID NO: 5: n indicates a, g, c, or t (Location: 12) 

SEQ ID NO: 6: A partial amino acid sequence of endo- 0 -N-acetylglucosaminidase 

SEQ ID NO: 7: An oligonucleotide designed from a partial amino acid sequence of endo- 0 - 

N-acetylglucosaminidase 

SEQ ID NO: 7: n indicates a, g, c, or t (Location: 6) 

SEQ ID NO: 8: An oligonucleotide designed from a partial amino acid sequence of endo- # - 
N-acetylglucosaminidase 
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SEQ ID NO: 8: n indicates a, g, c, or t (Location: 15) 

SEQ ID NO: 9: A partial amino acid sequence of endo- 0 -N-acetylglucosaminidase 

SEQ ID NO: 10: An oligonucleotide designed from a partial amino acid sequence of 

endo- 0 -N-acetylglucosaminidase 

SEQ ID NO: 11: An oligonucleotide sequence of the 5' terminal region of an endo- 

0 -N-acetylglucosaminidase gene 

SEQ ID NO: 12: An oligonucleotide sequence of the 3* terminal region of an endo- 

0 -N-acetylglucosaminidase gene 

SEQ ID NO: 13: An oligonucleotide designed from the nucleotide sequence of an 

endo- 0 -N-acetylglucosaminidase gene 

SEQ ID NO: 14: An oligonucleotide designed from the nucleotide sequence of an 

endo- 0 -N-acetylglucosaminidase gene 

SEQ ID NO: 15: An oligonucleotide designed from the nucleotide sequence of an 

endo- 0 -N-acetylglucosaminidase gene 

SEQ ID NO: 16: An oligonucleotide designed from the nucleotide sequence of an 

endo- 0 -N-acetylglucosaminidase gene 



SEQ ID NO: 20 
SEQ ID NO: 21 
SEQ ID NO: 21 
SEQ ID NO: 21 
SEQ ID NO: 21 
SEQ ID NO: 21 
SEQ ID NO: 21 
SEQ ID NO: 21 
SEQ ID NO: 21 
SEQ ID NO: 27 
SEQ ID NO: 28 
SEQ ID NO: 33 



Xaa indicates Met or Ser (Location: 2) 
Xaa indicates Gly or Met (Location: 2) 
Xaa indicates Gin or Ala (Location: 3) 
Xaa indicates Arg or Leu (Location: 4) 
Xaa indicates Asn or Pro (Location: 6) 
Xaa indicates Arg or Leu (Location: 8) 
Xaa indicates Glu or Leu (Location: 9) 
Xaa indicates Ser or Leu (Location: 10) 
Xaa indicates His or Thr (Location: 11) 
carboxymethylcystein (Location: 3) 
carboxymethylcystein (Location: 6) 
carboxymethylcystein (Location: 8) 
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